A box-covering algorithm for fractal scaling in scale-free networks 
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Abstract 

A random sequential box-covering algorithm recently introduced to measure the fractal dimension in 
scale-free networks is investigated. The algorithm contains Monte Carlo sequential steps of choosing the 
position of the center of each box, and thereby, vertices in preassigned boxes can divide subsequent boxes 
into more than one pieces, but divided boxes are counted once. We find that such box-split allowance in 
the algorithm is a crucial ingredient necessary to obtain the fractal scaling for fractal networks; however, 
it is inessential for regular lattice and conventional fractal objects embedded in the Euclidean space. Next 
the algorithm is viewed from the cluster-growing perspective that boxes are allowed to overlap and thereby, 
vertices can belong to more than one box. Then, the number of distinct boxes a vertex belongs to is dis- 
tributed in a heterogeneous manner for SF fractal networks, while it is of Poisson-type for the conventional 
fractal objects. 
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A box-covering method is a basic tool to measure the fractal dimension of conventional 
fractal objects embedded in the Euclidean space. Such a method, however, cannot be ap- 
plied to scale-free networks that exhibit an inhomogeneous degree distribution and the small- 
worldness. The Euclidean metric is not well defined in such networks. To check the fractality, 
a random sequential box-covering algorithm was recently introduced. In the algorithm, ver- 
tices within a box can be disconnected, but connected via a different box or boxes. Here we 
show that such box-split allowance is an essential ingredient to obtain the fractal scaling in 
scale-free networks, while it is inessential for the conventional fractal objects. Moreover, the 
algorithm is viewed from a different perspective that boxes are allowed to overlap instead of 
being split and thereby, vertices can belong to more than one box. Then, the number of dis- 
tinct boxes a vertex belongs to is distributed in a heterogeneous manner for scale-free fractal 
networks, while it is of Poisson-type for the conventional fractal objects. 



I. INTRODUCTION 



Fracta 



objects that are embedded in the Euclidean space have been observed in diverse phenom- 



ena liO. They contain self- similar structures within them, which are characterized in terms of 



non-integer dimension, i.e., the fractal dimension dg, defined in the fractal scaling relation, 

N B {t B )~r B dB . (i) 

Here Nb(£b) is the minimum number of boxes needed to tile a given fractal object with boxes of 
lateral size £b- This counting method is called the box-covering method. 



Fractal scaling (OQ) was also observed recently 



in real-world scale-free (SF) networks such 



as the world-wide web yfl, metabolic network of Escherichia coli and other microorganisms |4|], 



are those that exhibit a 



and protein interaction network of Homo sapiens [|5|]. SF networks 
power-law degree distribution P c i(k) ~ Degree k is the number of edges connected to a given 
vertex. For such fractal networks, since their embedded space is not Euclidean, the Euclidean 
metric is replaced by the chemical distance. 

One may define the fractal dimension in another manner through the mass-radius relation. The 
average number of vertices (Mc(£c)) within a box of lateral size £c, called average box mass, 
scales in a power-law form, 

(M c (£c))~4 B , (2) 
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with the fractal dimension dg- This counting method is called the cluster- growing method below. 
Hereafter, the subscripts B and C represent the box-covering and the cluster-growing methods, 
respectively. The formulas (QQ) and © are equivalent when the relation N ~ Nb(£b){Mc(£c)) 
holds for £g = tc- Such case can be seen when fractal objects are embedded in the Euclidean 
space. However, for SF fractal networks, the relation © is replaced with the small- world behavior, 

(M c (£c))~A/ fo , (3) 

where £q is a constant. Thus, the fractal scaling can be found in the box-covering method, but not 
in the cluster-growing method for SF fractal networks. 

To understand this seemingly contradictory relations, here we investigate generic nature of the 
box-covering method in SF networks in comparison of the cluster-growing method. Owing to the 
inhomogeneity of degrees in SF fractal networks, the way of covering a network can depend on 
detailed rules of box-covering methods. Recently, a new box-covering algorithm was introduced 



by the current authors 03, ISA- In fact, this algorithm shares a common spirit with the one previously 
introduced by Song et al. B2D, however, details differ from one another in the following perspective: 
Our algorithm, called random sequential (RS) box-covering method, contains a random process 
of selecting the position of the center of each box. A new box can overlap preceding boxes. In 
this case, vertices in preassigned boxes are excluded in the new box, and thereby, vertices in the 
new box can be disconnected within the box, but connected through a vertex (or vertices) in a 
preceding box (or boxes). Nevertheless, such a divided box is counted as a single one. Detailed 
rule is described in the next section. Such counting method is an essential ingredient to obtain the 
fractal scaling in fractal networks; whereas, it is inessential for regular lattice and conventional 
fractal objects embedded in the Euclidean space. 

Next, we count how many boxes a vertex belongs to in the cluster-growing algorithm, where 
boxes are allowed to overlap. For the SF fractal network, the fraction of vertices counted / times 
decays with respect to / in a nontrivial manner, while for the square lattice and a conventional 
fractal object, it decays in a Poisson-type manner. We note that the Sierpinski gasket is used 
here as a fractal object embedded in the Euclidean space. Such distinct features arising in the 
SF fractal networks enables the coexistence of the two contradictory notions of the fractality and 
small-worldness. 
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FIG. 1: (Color online) Schematic illustration of the RS box-covering algorithm introduced fcj± Vertices 
are selected randomly, for example, from vertex 1 to 4 successively. Vertices within distance £b = 1 from 
vertex 1 are assigned to a box represented by solid (red) circle. Vertices from vertex 2, not yet assigned 
to their respective box are represented by dashed-doted-doted (black) closed curve, vertices from vertex 3 
are represented by dashed-doted (green) circle and vertices from vertex 4 are represented by dashed (blue) 
ellipse. 

II. RANDOM SEQUENTIAL BOX-COVERING 

Here we describe a new box-covering method, which takes steps as follows: We start with all 
vertices labeled as not burned. Then, 

(i) Select a vertex randomly at each step; this vertex serves as a seed. 

(ii) Search the network by distance £g from the seed and burned all vertices found but not burned 
yet. Assign newly burned vertices to the new box. If no newly burned vertex is found, the 
box is discarded. 

(iii) Repeat (i) and (ii) until all vertices are assigned to their respective boxes. 

The above method is schematically illustrated in Fig.CQ A different Monte Carlo realization of 
this procedure ((i)-(iii)) may yield a different number of boxes for covering the network. In this 
study, for simplicity, we choose the smallest number of boxes among all the trials. To obtain the 
power-law behavior of the fractal scaling, we needed at most (10) Monte Carlo trials for all frac- 
tal networks we study. It should be noted that the box number Ng we employ is not the minimum 
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FIG. 2: (Color online) Fractal scaling analysis for the two dimensional square lattice with the box-covering 
algorithm. Shown are the result of one Monte Carlo trial (o) and that obtained from 20 Monte Carlo trials 
(□). From the least-square-fit of the data (straight line), the fractal dimension is measured to be «2.0 as 
expected. 

number among all the possible tiling configurations. Finding the actual minimum number over all 
configurations is a challenging task, which could not be reached by the Monte Carlo method. 

To check the validity of our algorithm, we first apply our method to the two dimensional regular 
lattice in Fig. [2] While our method may perform inefficiently in the highly regular structure due to 
the step in which already box-assigned vertices can be selected as seeds, taking about a few hours 
of cpu time for system size N = 500 x 500, we find that our method can still yield the correct 
dimension ^2.0 for the two dimensional square lattice, as shown in Fig.[2l We also show that the 
number of Monte Carlo trials is not crucial to obtain the fractal scaling. 

Figj3] shows the box-covering method applied to the Sierpinski gasket with the third generation 
(a). One can find easily that Nb(£e = 2) = 3 in (b) and Nb(£e = 1) = 9 in (c) when lateral size ig 
is taken as the conventional Euclidean metric. The obtained Nb(£e) are the minimum numbers of 
boxes needed to tile the object for each case. In Figj3^d), we show a configuration in box-covering 
ensemble obtained from our current algorithm with distance £b = 1. One can see that Nb(£b) can 
vary depending on Monte Carlo trials. We show, however, that the fractal dimension is obtained 
by using the conventional method in (b) and (c), ds = — In 3/ In 2. Numerical value is obtained 
from the Sierpinski gasket with the 12th generation, composed of 265,721 vertices, and the fractal 
scaling is shown in Fig. HI 

Our algorithm also generates the same fractal dimensions for SF fractal networks such as the 
world-wide web, the metabolic network of E. coli, the protein interaction networks of H. sapiens 
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FIG. 3: (Color online) A fractal object, the Sierpinski gasket with the second generation (a). Conventional 
box covering based on the Euclidean metric with size 1% = 2 (b) and i E = \ (c). Box covering based 
on chemical distance is = 1 by using the RS box-covering method (d). Seed vertices 1 — > 5 are selected 
successively. 
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FIG. 4: (Color online)Fractal dimension of the Sierpinski gasket with the 12th generation measured by 
using the RS box-covering method. Solid line is guideline with slope — In 3/ In 2. Shown are the result of 
one Monte Carlo trial (o) and that obtained from 20 Monte Carlo trials (□). 

and S. cerevisiae as obtained by Song et al. |20- Figj5] shows the fractal scaling for the world- 
wide web, displaying the same fractal dimension ds ~ 4.1. In comparison of the method by Song 
et al., ours is easier to implement, because it does not contain the procedure for constraining the 
maximum separation within a box and is carried out in random sequential manner in box covering. 

The particular definition of box size has proved to be inessential for fractal scaling. It is rather 
inappropriate to compare the length scale £g with that is used in Ref. [21], because the two methods 
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FIG. 5: (Color online) Comparison of the box-covering methods introduced by Song et al. |[2|] (□) and in 
this paper (o) for the world-wide web. The results obtained from the two box-covering methods applied 
to the world-wide web are plotted here. The two methods yield the same fractal dimension ds ~ 4.1. The 
method introduced by Song et al. is more optimal than ours in the viewpoint that Nb{£s) < Nb{2£b + !)• 
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FIG. 6: (Color online) Fractal scaling analysis for the world-wide web by the two box-covering algorithms, 
that of Song et al. (□) and of ours (o). For comparison, the horizontal scale for that of Song et al. is rescaled 
as ^s/1.5 — > £b, by which we get the overlap of two curves obtained from the different algorithms. 

involve different definitions of the boxes. What is interesting, however, is that there exists a linear 
relationship, for example £5/1.5 — > £b in the case of the world-wide web as shown in Fig. [6] This 
linear relationship indicates that despite the difference in the two algorithms, such a difference 
does not lead to qualitatively different fractal- scaling behaviors. 
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FIG. 7: (Color online) Fractal scaling analysis for the world-wide web with the RS box-covering algorithm 
(o) and its variant that disallows disconnected boxes (□) 




1 2 4 8 16 32 64 128 256 




1 2 4 8 16 32 64 128 256 



FIG. 8: (Color online) Fractal scaling analysis for the square lattice (a) and the Sierpinski gasket (b) with 
the RS box-covering algorithm (o) and its variant that disallows disconnected boxes (□). Solid lines are 
guidelines with slopes of —2 in (a) and — In 3/ In 2 in (b). 

III. OVERLAP OF BOX COVERING AND VERTICES DISCONNECTEDNESS 

It is interesting to note that in our algorithm vertices can be disconnected within a box, but 
connected through a vertex (or vertices) in a different box (or boxes) as in the case of box 2 shown 
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in Fig.Q] On the other hand, if we construct a box with only connected vertices, for example, box 
2 is regarded as two separate boxes, then the power-law behavior Eq. (OQ) is not observed for the 
world-wide web as shown in Fig. |7J We check if such difference appears even for a regular lattice 
and a fractal object embedded in the Euclidean space. Figs. [8] show that such different behavior 
does not occur for the square lattice in two dimensions and the Sierpinski gasket. We show that 
such fact originates from the inhomogeneity of degrees in SF networks as follows: Owing to their 
large degree, hub vertices can be assigned to boxes earlier than other vertices when their neighbors 
are selected as seeds of boxes. Once hub vertices are assigned to one of the boxes, they can make 
subsequent boxes disconnected when vertices in those boxes are connected via hub vertices. Box 
2 in Fig. [T]is such a case. In SF networks, such cases occur at a non-negligible rate. 

To study the fraction of disconnected boxes quantitatively, we invoke the cluster-growing ap- 
proach. In this approach, boxes are allowed to overlap, and thereby, a vertex can belong to more 
than one box. Thus, the extent of overlap of the boxes during the tiling can provide important 
information on the fraction of disconnected boxes in the box-covering method. In this regard, we 
reported the cumulative fraction F c (f) of vertices counted / times or more in the cluster- growing 
method for the world-wide web in QSU and is reproduced in Fig.[9l The cumulative fraction F c (f) 
is likely to follow a power law for small /, thereby indicating that the overlaps occur in a non- 
negligible frequency even for a small distance lc- The associated exponent decreases with increase 
in box size Iq as the chances of overlaps increase. However, for large values of /, the large frac- 
tion of vertices counted exceed the frequency extrapolated from the power-law behavior. For the 
square lattice and the Sierpinski gasket, however, the fraction F(f) follows a bounded distribution 
with a peak at small / as shown in Fig.|T0l Thus, for the fractal networks like the world-wide web, 
there are a significant number of vertices that are counted quite a few times in the cluster-growing 
method, but such vertices are extremely rare in the conventional fractal objects such as the Sier- 
pinski gasket. Such multiple counting due to overlap is excluded in the box-covering method. This 
exclusion effect makes the average mass of a box in the box-covering method significantly lower 
than that in the cluster-growing method. 

Next, one may wonder if the RS box-covering algorithm can be improved in efficiency by 
excluding already-burned vertices from the list of the root candidates of new boxes. In Fig. [TTJ 
we compare the fractal scaling behaviors obtained from the two cases of keeping or excluding 
already-burned vertices from the list for two networks: the world-wide web (a) and the fractal 
model introduced in [71. We find that the two cases exhibit somewhat different behaviors. If 
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FIG. 9: (Color online) Cumulative fraction F c (f) of the vertices counted / times in the cluster-growing 
algorithm. F c {f) follows a power law in the small / region, where the slope depends on box size ic- 
However, for large values of /, the data largely deviate from the value extrapolated from the power-law 
behavior. Data are presented for ic = 2 (•), i c = 3 (■), and ic = 5 (A). 



0.8 




f 



FIG. 10: (Color online) Fraction F(f) of the vertices counted / times in the cluster-growing algorithm for 
the Sierpinski gasket with the 12th generation composed of 265,721 vertices. F(f) follows a Poisson-type 
distribution. Data are presented for £c = 2 (•), i c = 3 (■), and ic = 5 (A). 

the already-burned vertices are excluded from the next selection, the power-law behavior is not 
obtained for the world-wide web. However, they exhibit similar power-law behaviors for the 
fractal model, even though the two data sets show somewhat deviations. 
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FIG. 11: (Color online) Fractal scaling analysis with the rules that allow (o) or disallow (□) the already- 
box-burned vertices to be chosen as the roots of new boxes for the world-wide web (a), and the fractal model 
(b). 

IV. CONCLUSIONS AND DISCUSSION 

We have studied various features of the random sequential box-covering algorithm by applying 
it to a SF fractal network, the world-wide web, and a regular and a conventional fractal object, 
the square lattice and the Sierpinski gasket, respectively. Results obtained from the two classes 
of networks exhibit distinct feature. The condition that vertices in a box can be disconnected 
in the box-covering method turns out to be an essential ingredient to have the fractal scaling 
for a SF fractal network, however, it is irrelevant for a regular lattice and a conventional fractal 
object embedded in the Euclidean space. We also found that the fraction of vertices counted 
/ times in the cluster-growing method exhibits a non-trivial behavior for the former, while it 
does a trivial behavior for the latter. The two results are complementary; thereby, the SF fractal 
network exhibits the fractal scaling (0Q) in the box-covering and the small-world behavior © 
in the cluster-growing method. Finally, it is noteworthy that our box-covering algorithm is a 
modification of the algorithm used in the random sequential packing problem [^]. 

This work was supported by KRF Grant No. R14-2002-059-010000-0 of the ABRL program 
funded by the Korean government (MOEHRD). Notre Dame's Center for Complex Networks 
kindly acknowledges the support of the National Science Foundation under Grant No. ITR DMR- 



11 



0426737. 



[1] J. Feder, Fractals (Plenum, New York, 1988). 

[2] C. Song, S. Havlin, and H.A. Makse, Nature (London) 433, 392 (2005). 

[3] R. Albert, H. Jeong, and A.-L. Barabasi, Nature (London) 401, 130 (1999). 

[4] H. Jeong, B. Tombor, R. Albert, Z.N. Oltvai, and A.-L. Barabasi, Nature (London) 407, 651 (2000). 

[5] I. Xenarious, et al. Nucleic Acids Res. 28, 289 (2000). 

[6] A.-L. Barabasi and R. Albert, Science 286, 509 (1999). 

[7] K.-I. Goh, G. Salvi, B. Kahng and D. Kim, Phys. Rev. Lett. 96, 018701 (2006). 

[8] J.S. Kim, K.-I. Goh, G. Salvi, E. Oh, B. Kahng, and D. Kim, to appear in Phys. Rev. E. 

[9] M. Nakamura, J. Phys. A. 19, 2345 (1986); J.W. Evans, ibid 20, 3063 (1987). 



12 



